Dynamic Online Pricing with Incomplete Information
نویسندگان
چکیده
Consider the pricing decision for a manager at large online retailer, such as Amazon.com, that sells millions of products. The pricing manager must decide on real-time prices for each of these product. Due to the large number of products, the manager must set retail prices without complete demand information. A manager can run price experiments to learn about demand and maximize long run profits. There are two aspects that make the online retail pricing different from traditional brick and mortar settings. First, due to the number of products the manager must be able to automate pricing. Second, an online retailer can make frequent price changes. Pricing differs from other areas of online marketing where experimentation is common, such as online advertising or website design, as firms do not randomize prices to different customers at the same time. In this paper we propose a dynamic price experimentation policy where the firm has incomplete demand information. For this general setting, we derive a pricing algorithm that balances earning profit immediately and learning for future profits. The proposed approach marries statistical machine learning and economic theory. In particular we combine multi-armed bandit (MAB) algorithms with partial identification of consumer demand into a unique pricing policy. Our automated policy solves this problem using a scalable distribution-free algorithm. We show that our method converges to the optimal price faster than standard machine learning MAB solutions to the problem. In a series of Monte Carlo simulations, we show that the proposed approach perform favorably compared to methods in computer science and revenue management. ∗Ross School of Business, University of Michigan. Contact: [email protected] †Rady School of Management, University of California, San Diego. Contact: [email protected] ‡Department of Electrical Engineering and Computer Science, University of Michigan. Contact: [email protected]
منابع مشابه
Optimal (R, Q) policy and pricing for two-echelon supply chain with lead time and retailer’s service-level incomplete information
Many studies focus on inventory systems to analyze different real-world situations. This paper considers a two-echelon supply chain that includes one warehouse and one retailer with stochastic demand and an up-to-level policy. The retailer’s lead time includes the transportation time from the warehouse to the retailer that is unknown to the retailer. On the other hand, the warehouse is unaware ...
متن کاملDynamic Online Pricing with Incomplete Information Using Multi-Armed Bandit Experiments
Consider the pricing decision for a manager at large online retailer, such as Amazon.com, that sells millions of products. The pricing manager must decide on real-time prices for each of these product. Due to the large number of products, the manager must set retail prices without complete demand information. A manager can run price experiments to learn about demand and maximize long run profit...
متن کاملAPPLICATION OF GAME THEORY IN DYNAMIC COMPETITIVE PRICING WITH ONE PRICE LEADER AND N DEPENDENT FOLLOWERS
In this research UF cheese pricing is considered and Pegah, Pak, Kaleh, Rouzaneh and Mihan firms’ data, as five main UF cheese competitive firms of Iran in breakfast cheese competitive market, is used. By using these firm’s sales data, production data and price of each ton of UF cheese in nineteen work-periods (each work-period is 6 months), their sales equations are estimated for each work-pe...
متن کاملIncentive-Based Pricing for Network Games with Complete and Incomplete Information
In this paper, we introduce the concept of dynamic pricing within the context of our previous Stackelberg network game model and view the ISP’s policy as an incentive policy, and the underlying game as a reverse Stackelberg game. We study this incentive-design problem under complete information as well as incomplete information. In both cases, we show that the game is not generally incentive co...
متن کاملA DSS-Based Dynamic Programming for Finding Optimal Markets Using Neural Networks and Pricing
One of the substantial challenges in marketing efforts is determining optimal markets, specifically in market segmentation. The problem is more controversial in electronic commerce and electronic marketing. Consumer behaviour is influenced by different factors and thus varies in different time periods. These dynamic impacts lead to the uncertain behaviour of consumers and therefore harden the t...
متن کامل